Transmission of information via the non-linear Scroedinger equation: 
The random Gaussian input case 
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The explosion of demand for ultra-high information transmission rates over the last decade has 
necessitated the usage of increasingly high light intensities for fiber optical transmissions. As a 
result, the fiber non-linearities need to be treated non-perturbatively. Similar analyses in the past 
have focused on the effects of non-linearities on existing transmission technologies, e.g. WDM. 
In this paper we take advantage of the fact that, under certain assumptions, light transmission 
through optical fibers can be described using the non-linear Schroedinger equation, which is exactly 
integrable. As a particular example, we show that in the low Gaussian noise limit, the Gaussian 
input distribution has a higher mutual information than the transmission using WDM over the same 
available bandwidth. 



INTRODUCTION 

The possibility of light soliton propagation in a sil- 
ica fiber was first predicted by Hasegawa and Tappert 
m 1973 Since then a tremendous amount of work 

has been done on both theoretical and experimental as- 
pects of light soliton propagation. Moreover, optical fiber 
technology has become the centerpiece of wired telecom- 
munications, with optical fiber networks crossing oceans 
and webbing continents, carrying the world's digital com- 
munications in the form of light soliton pulses. 

The property of the silica fiber that makes soliton prop- 
agation possible is the Kerr nonlinearity, i.e. the depen- 
dence of the index of refraction n on the intensity of light 
/: 



n = no{uj) + 712/. 



(1) 



From this one can straightforwardly derive the equation 
that governs the propagation of light inside the fiber, the 
non- linear Schrodinger equation (NLSE): 



,du d^u 
dt dx^ 



+ 2k\u\'^u = 



(2) 



where u{x,t) is the envelope of the electric ficld^ and 
K = ±1. The positive value of k, describing light prop- 
agation in the anomalous dispesion regime, gives the at- 
tractive NLSE that admits solitonic solutions with zero 
boundary conditions (bright solitons). For k = — 1 we 
get the repulsive NLSE, which is valid in the normal 
dispersion regime and admits solitons for nonvanishing 
boundary conditions (dark solitons). For either sign of 
At, the NLSE is an integrable hamiltonian system. It 
belongs to a class of nonlinear equations (together with 
the KdV equation, the sine-Gordon equations and oth- 
ers) that can be solved exactly by means of the inverse 



scattering transform (1ST) technique (see e.g. [^, |3| and 
references therein) . 

Optical fiber channels are capable of extremely high 
data transfer rates and constant technological improve- 
ments have led to an almost exponential increase in real- 
world transmission rates. The question naturally arises 
then, what is the upper bound imposed on the bit rate by 
the physics of the optical fiber, irrespective of any par- 
ticular technological setup! The natural framework in 
which one can address this question is that of informa- 
tion theory, developed by Shannon . In any communi- 
cation channel, the limiting factor for the rate at which 
it can carry information is the noise that unavoidably 
enters along the channel and corrupts the data. Shan- 
non introduced the concept of channel capacity, defined 
as the maximum possible bit-rate for error-free transmis- 
sion. The channel capacity C is defined by: 



C = maxp^ [H[y]-{H[y\x])^^} 



(3) 



where y is the output (received) signal, x is the in- 
put (sent) signal, p^ is the probability distribution of 
"symbols" in x (be it letters, fourier components, soliton 
modes or what it may), and H is the entropy of informa- 
tion: 



H[x] 



dxpx log Pa; 



(4) 



The maximum in ([3|) is taken among all possible input 
distributions px- The two quantities that ultimately de- 
termine the performance of the channel are the signal to 
noise ratio (SNR) for the received signal and the band- 
width. For a linear channel (e.g. a copper wire) with 
additive noise. Shannon's celebrated result 0] states 
that 



^ For light propagation inside a fiber, t in l(2j denotes position 
along the fiber and x denotes time in the comoving frame. 



C = Wlog(^l + pj, 



(5) 
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W being the bandwidth^ and S, N the average power of 
the signal and the noise respectively. 

However, modern fiber-optics systems operate in a sub- 
stantially non-linear regime, rendering the assumption of 
linearity used to derive ([5]) invalid. The additive noise in 
fiber-optics systems comes from periodically spaced am- 
plifiers (usually erbium-doped segments of fiber) that off- 
set the loss in the electric field amplitude along the fiber'^. 
These inject noise into the signal, mainly because of am- 
plified spontaneous emission of photons ( ASE) [13, [l^ ■ 
The initial approach to the effects of noise was to de- 
termine how the additive amplifier noise perturbs sin- 
gle solitons, and calculate the jitter introduced into the 
soliton trains. The result is the so called Gordon-Haus 
jitter [3, 121- For both dark and bright solitons, one 



finds that the perturbation in the frequency (which is 
also proportional to the velocity of the soliton) is a zero- 
mean gaussian with variance proportional to the strength 
of the amplifier noise and the amplitude of the soliton. 
This results in a jitter in the soliton arrival times that 
can cause reading errors at the receiver. A lot of work 
has been done on overcoming the restrictions in bit rate 
due to this effect^. The mechanisms proposed typically 
involve some form of optical filters or dispersion compen- 
sation, such as sliding- frequency filters [l8[ , syiichronous 
modulation |3, optical phase conjugation (iol- 22|. and 



dispersion managed solitons [2i,[2|| (there are numerous 
papers on the sub ject of suppressing the Gordon-Haus 
effect, see ch.l2 of [15| for a partial list of references). 

This approach is useful when one is considering specific 
signaling schemes that use soliton trains of fixed ampli- 
tude and inter-soliton distance, but to answer the ques- 
tion of maximum achievable channel capacity we need to 
abandon the single-soliton assumption and venture out 
towards a more abstract and general method. In Shan- 
non's theory of linear channels, for an input with a given 
average power one can show that the the maximal distri- 
bution in ([3]) is the gaussian. For the nonlinear channel 
under study, the maximal distribution is much harder 
to find, but one can still obtain bounds that convey the 



^ The loss mechanisms for light propagating through silica limit 
to a maximum of 50 THz 7] . Systems in practical use at the 
moment have a 15 THz bandwidth. 

^ Besides additive noise, there is also multiplicative noise. This 
becomes especially important when one considers wavelength di- 
vision multiplexing (WDM) systems, in which the bandwidth is 
divided in a multitude of sub-bands (channels). Because of non- 
linearity, signals in different channels interact with each other. 
Because of the statistical independence of signals in different 
channels one can use a model in which each channel sees the 
rest of the bandwidth as multiplicative noise. Very interesting 
work in the direction of determining the impact of multiplicative 
noise on the capacity of fiber-optics channels has been done in 
id 

* For practical purposes, one needs to obtain a bit error rate lower 
than 10-9 



qualitative behavior of C as a function of the SNR by 
using a gaussian as input Starting with a zero- mean 
gaussian random electric field with given, constant sec- 
ond moment, we calculate the density of soliton modes 
produced by it inside the fiber. The soliton modes are 
the natural degrees of freedom to use, like the Fourier 
coefficients would be for a linear channel. The distribu- 
tion of (either dark or bright) soliton modes is calculated 
as the density of states (density of eigenvalues) of the 
linear operator associated with the NLSE in the context 
of the inverse scattering transform. For the NLSE, the 
associated linear problem is the 2x2 Zakharov-Shabat 
eigenvalue problem [25|] . The eigenvalue spectrum of the 
Zakharov-Shabat system includes both continuous and 
discrete eigenvalues. We are interested in the discrete 
eigenvalues, because they correspond to soliton modes, 
and these can be isolated by imposing zero boundary 
conditions on the eigenstates. One can then use adia- 
batic perturbation theory based on the 1ST to compute 
the statistical uncertainty introduced in the eigevalues by 
the amplifier noise. This leads to a different distribution 
at the output, from which the mutual information can 
be found, and the aforementioned bounds on the capac- 
ity C can in principle be computed. The implementation 
of adiabatic perturbation theory can only be done numer- 
ically in the dense soliton limit that we are examining. 



DARK SOLITONS 

For dark solitons, the associated linear eigenvalue 
problem is hermitian: 



Uh = 



0) 



d x 



(6) 



To isolate the soliton modes, we impose the boundary 
conditions ipi,'ip2 ^ a,s x ±oo. Hermiticity means 
that the eigenvalue A is real. For a single soliton, A would 
give both the velocity and the amplitude of the soliton. 
In the dense soliton case we are interested in, the eigen- 
values are the collective degrees of freedom of the soliton 
modes. The "potential" u{x, 0) is the initial condition of 
the NLSE ([2]) or, in the case of optical fibers, where 'x' 
denotes time, the envelope of the electric field that goes 
into the fiber. We choose u to be gaussian random, i.e. 



u{x) 



1 



V2 



{ui{x) + iU2{x)) 



(7) 



with wi,M2 real, {ui{x)) = 0, and (ui{x)uj{x')) = 
Ddijd{x — x') , — 1,2. The constraint on the sec- 
ond moments of Ui translates to a power constraint for 
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the ingoing electric field, specifically that it has average 
power D. The DOS of this operator with these bound- 
ary conditions has been known in the literature for some 
time ^2^-!29j. The DOS is constant, independent of A. In 
FIG. [T]we see the results of numeric simulations for the 
DOS. The simulation was done usiiig an adaptation of the 
modified Ablowitz-Ladik scheme [30| for the hermitian 
problem. Beyond determining the DOS, we want to know 
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FIG. 1: DOS of the hermitian Zakharov-Shabat eigenvalue 
problem as a function of the eigenvalue A. The eigenvalues 
were found by direct diagonalization, using an adaptation 
of the modified Ablowitz-Ladik scheme {D = 1, size = 20, 
step — 0.1, 200 runs). We see the independence of the DOS 
from A. 



Numerically, we find the distribution of the distances 
between neighboring eigenvalues. Statistical indepen- 
dence means that the distances must follow a Poisson 
distribution, and this is indeed what we find. The re- 
sults are shown in FIG. [5] 




FIG. 2: Logarithmic plot of the distribution of the distance 
between successive eigenvalues of the hermitian Zakharov- 
Shabat operator {D — 1, size = 100, step = 0.1, 100 runs). 
The linearity of the data shows that the distances follow a 
Poisson distribution, i.e. there is no bias in the magnitude of 
the distance, in agreement with our theoretical result about 
statistical independence of eigenvalues even in the dense soli- 
ton limit. 



whether the eigenvalues are statistically independent, i.e. 
whether p(A, A') ~ p{X)p{X'). We were able to show 
this both analytically and numerically. On the analyt- 
ical side, we used Halperin's method [3l|, |32| . We define 

the variables = arg(|ig), 0'{x) - arg(|fM). 
Their evolution along x is a Markov process and from 
we can derive the Fokker-Planck equation for their 
probability distribution P{x; 9, 9'): 



dP _ dP ,dP 
9^ ~ '89^ W 



D 



Q2p Q2p 



2D 



32 



8686 



89''- 

- (cos(6' - 6')P) . 



89^ 



(8) 



') 



We also derive the equation for the quantity F{x; 
(^§^5{9{x) - 9)5{9'{x) - 61')) (the brackets denote av- 
eraging over the gaussian ensemble of w's) 



8F 
8x 



-H 



D 



8'^F 8^F 



(9) 



2Dcos(9 - 



8^F 
' 8989' 



where H{x; 9, 9'), G{x; 9, 9') satisfy the same equation as 
P{x] 9, 9') but with ~P as an extra source term. F grows 
as yo(A, A') X x^ for large x and from (Pl. pHj) we are able 
to calculate p(A, A') and show that it factorizes. 



The effect of (weak) additive noise coming from peri- 
odic amplification on the eigenvalues can be studied in 
the context of adiabatic perturbation theory and the 1ST 
'ss']. It is thus shown that for white gaussian noise, the 
disturbance of the eigenvalues is gaussian, with zero mean 
and a variance (AA^^ proportional to the strength of the 
noise and the inverse participation ratio J dxWipW"^ of the 
corresponding (normalized) eigenstates. The inverse par- 
ticipation ratio (IPR)in the many soliton situation that 
we are interested in is beyond analytical treatment. In 
the next two figures we show the results of numerical sim- 
ulations where we calculated the IPR for the eigenstates 
of Uh found using direct diagonalization of Uh with a 
central differences discretization (in order to impose zero 
boundary conditions at the edges). 

In FIG. [3] we see how the IPR (locally averaged for 
smoothness) behaves as a function of the eigenvalue A 
for a given D. We are actually interested only in the 
middle, flat section, because the raising of the edges is 
an artefact of the central difference method that creates 
an over-concentration of eigenvalues near the edges of the 
spectrum. Averaging over this flat segment for different 
D's, we see in FIG.|3]the dependence of / (ia;||'(/'||^ on the 
average input power D. The linearity indicates that the 
IPR is actually proportional to the localization length 
(the inverse Lyapunov exponent) of Uh [29| . 
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FIG. 3: Inverse participation ratio as a function of the eigen- 
value A, for a given average power of the input (D — 1, 
size = 100, step = 0.1, 100 runs). 




FIG. 4: Inverse participation ratio, averaged over the eigen- 
values, as a function of the average input power D (size = 100, 
step = 0.1, 100 runs). 



BRIGHT SOLITONS 

The non-hcrmitian Zakharov-Shabat eigenvalue prob- 
lem is defined by the system of equations 



Uip{x) = zil){x), 



U = 



(10) 



on the infinite line, together with the boundary condi- 
tions — >■ as X — >■ ±oo. The star denotes com- 
plex conjugation, and the eigenvalue z = ^ -I- 177 is gen- 
erally complex. As in the hermitian case, the "poten- 
tial" u{x, 0) is the envelope of the electric field that goes 
into the fiber. We want to find the density of states 



(DOS) of this operator when u is gaussian random, as in 
([T]). The DOS p(^, 77) of this random operator determines 
the entropy of information carried by the gaussian signal 
through the formula I = — J dS,dr]p(^, rf) log p(^, 77). 

Contrary to its hermitian counterpart, this DOS is not 
known in the literature. Halperin's method [3l|, which 
works so nice for the hermitian/dark soliton case, fails 
here because of non-hermiticity. Non-hermitian random 
operators have received considerable attention in the lit- 
erature (see [13] for a list of important references on 
the subject). They have a wide range of applications, 
in non-equilibrium statistical mechanics, random clas- 
sical dynamics, the physics of polymers, QCD, neural 
networks, and, in the case at hand, soliton physics and 
communications. Non-hermiticity means that the eigen- 
values migrate to the complex plane. This complicates 
the computation of their statistical properties - most no- 
tably their DOS - relative to hermitian operators. The 
underlying reason for this is that the propagator in the 
hermitian case is analytic except on branch cuts of the 
real axis, where the real eigenvalues condense. This intro- 
duces constraints that facilitate the computation of the 
DOS. When the eigenvalues are complex this is no longer 
true. In many cases however one can obtain approximate 
results for the DOS. The hermitization method, devel- 
oped by Feinberg and Zee in ^35j can be applied here. 
The method gives the self-consistent Born approxima- 
tion (SCBA) to the density of states. Instead of looking 
at U directly, one starts with a "hermitized" operator 



H 







W - z* 



u 



(11) 







and calculates the propagator G = — 1^ in the SCBA. 
The DOS is then given by the derivative with respect to 
z* of the trace of the lower-left block of G. We omit the 
details of the calculation and state only the final result. 
We find the DOS to be uniform inside a uniform band 
centered around the ^ axis on the complex z plane, with 
the width being proportional to D: 




(12) 



To move beyond the SCBA, and see how the DOS 
"frays" near the edges of the band, we used the method of 
optimal fluctuations (s^ls^]. The basic idea is that states 
outside the band given by the SCBA are created by atyp- 
ically strong fluctuations of the "potential" u(x, 0). Such 
fluctuations occur with an exponentially small probabil- 
ity exp [— where 



W=^ I dx\uix,0)f 



(13) 



Minimizing the functional W[u] in (|13l) with respect to 
u, with the additional constraint det(J7 — z) = enforced 
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as a Lagrange multiplier, we can determine the density of 
states outside the band with exponential accuracy. The 
minimization involves the solution of a pair of coupled 
nonlinear ordinary differential equations and the result 
for the optimal potential is: 



u(x) = Airj 



(14) 



From (|13ll4p we then find that to exponential accuracy, 
the DOS for \r]\ » D is 



(15) 



Although these approximate methods provide some 
measure of knowledge of the DOS (and consequently the 
entropy of information of the gaussian random signal) , an 
exact expression is always desirable, for obvious reasons. 
We were able to obtain such a result using a variation 
of the Thouless formula 38|, that relates the DOS with 
the Lyapunov exponent of the operator U in ([T0| . The 
Lyapunov exponent A measures the rate of exponential 
increase of the modulus of a typical solution of ^TU\\ as 
one goes towards larger x with no boundary conditions 
are imposed. One possible definition is: 



A= lim -ln(\Mx)\^ + \M^W 



(16) 



We were able to show that for any set of finite boundary 
conditions, a unique, positive Lyapunov exponent exists 
and it is related to the DOS by the formula 



1 

2^ 



(17) 



The eigenvalue problem (jlOp has enough symmetry to 
make the exact calculation of A (and from this, p) possi- 
ble. The details of this calculation are contained in the 
attached paper (together with the detailed proof of (fT7|) 
and the relevant references) which has been published in 
Phys. Rev. E 39]. The expression we arrive at for the 
DOS is: 



2 ^coth(f)-l 



^^^'^^ irD sinh^fH^) 



Kd) 



(18) 



We have also been able to show that ((T8|) remains valid for 
large ^ in the case of a purely real gaussian random po- 
tential that maximally breaks rotational symmetry (com- 
pletely polarized input). The comparison of this predic- 
tion with numerical simulations is shown in FIG. [S] 

This description of the input signal in terms of the 
density of ^ and 77 is complete only in the cases where 
one can neglect the effects of jitter in the positions of 
the solitons and concentrate on the shift in the eigen- 
values as the primary cause of signal distortion. If we 
want to include the positional jitter, we must take into 




FIG. 5: Theoretical curve (solid line) and results of numerical 
simulations for the profile of the DOS vs. 77. We have used the 
modified Ablowitz-Ladik diagonalization scheme [30|. The 
value of D is 1, the size of the system is L = 135 and the 
step size is 0.075. The disturbance near = is a finite-size 
elTect. The localization length I = grows as I ~ 'iD/2'if 
near r) = Q and so numerical results become unreliable for 
1^1 ^ ^/dJL (- 0.1 here). 



account the information contained in the positions of the 
solitons. This is done by considering another set of com- 
plex numbers b^, one for each solitonic excitation, which 
are associated with soliton positions ^ and together with 
the eigenvalues fully determine the solitonic part of the 
signal. Since we study the effects of noise on a random 
Gaussian input pulse, where all eigenstates are localized 
in the infinite pulse duration limit, the eigenvalues z and 
and the corresponding hz give a complete description of 
the signal. 

We were able to compute the distribution of bz result- 
ing from a random gaussian input analytically by using 
the definition of bz and expressing it in terms of the be- 
havior of the eigenf unctions at the edges of the pulse, 
which in turn we can express as sums of random vari- 
ables with calculable distributions. More specifically, the 
precise asymptotic conditions for the eigenfunctions are: 



*z(a;) 
*z(x) 



*.(x) ( J I e- 



as a; — > 00 



(19) 



where the two sets of solutions are related through an 



^ In the simplest case of a single localized eigenstate with z = 
^ + 17] and bz = |6|e"^, the corresponding soliton has amplitude 
5, velocity 2ri, initial "position" to = ^^'^ initial phase 

00 = (/> - TT 3. 
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S-matrix: 





( b{z) a(z) \ 






\ Kz) ) 





(20) 



with the a, &'s being the transmission and reflection co- 
efficients respectively. By taking into account the sym- 
metry of the problem under complex conjugation it is 
possible to show that a{z*) = a*(z) and b{z*) = ~b*{z), 
where the star (*) denotes the complex conjugate. 

When the above solutions correspond to a localized 
eigenfunction with eigenvalue z, the transmission coeffi- 
cient a{z) has to vanish at that z, making the two sets of 
solutions directly proportional: 



(21) 



where $2 and $2 are the admissible exponentially de- 
caying eigenfunctions for Im{z) > and Im{z) < 0, 
respectively. Then, for ry > we have from (|19|21|) 



*.(x) 








(22) 



Defining ^{x) = "i^zi—x) we can write: 



b ~ lim b{x), b{x) 



A{x) ' 



(23) 



where for convenience we have dropped the subscript z. 
The time evolution of In 6 is found from pUj) . 



iiuf + u*f 



d\nb 



with fix) = fix) = i^, uix) ^ ui-x), and 



(24) 



df 

dx 



Bf 
dx 



tf>i ' 

~2iz/ -I- iu* — iuf"^ 
-2izf — iu -f iu*p. 



(25) 



Eqs. (|24l25p can be used to write down a Fokker- 
Planck equation for the probability distribution of In bz . 
The stationary solution of this equation is the actual dis- 
tribution of Inbz in the infinite pulse limit, and it turns 
out to be uniform, as we would expect from the transla- 
tional invariance of the input signal. The precise way in 
which this happens can be found by using (I24p to express 
In bz as a sum of random variables with known limit dis- 
tribution. The details of this calculation together with 
the relevant references are included in the attached pa- 
per. The result is a Gaussian distribution for the real 



part of In bz in the case of an unpolarized incoming pulse, 
with zero mean and variance growing as: 



smh (^) 2t 



(26) 



where T is the duration of the incoming pulse and t is 
the inverse bandwidth. The imaginary part of In b is an 
angle and so, although it follows the same distribution as 
the real part, will due to periodicity become uniformly 
distributed in [0,2tt). 

For a completely polarized incoming pulse, the distri- 
bution of In \ bz\ will asymptotically follow a Cauchy dis- 
tribution scaling like T/t. Its statistical median will be 
zero by symmetry. The phase of b will again be uniform 
over [0,27r), although the mechanism is different in this 
case. For the special case of ^ = 0, the scale parameter of 
the Cauchy distribution can be calculated more explicitly 
to be 
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gr;/-D y 



(27) 



loiv/D) T 

where lo is a modified Bessel function of the first kind. 

OPTICAL FIBER CHANNEL CAPACITY 

Armed with the knowledge of the distributions of scat- 
tering data {z, bz} generated by a random Gaussian in- 
put signal, we address the question of the capacity, or 
spectral efficiency, of the optical fiber channel. Histor- 
ically, there has been a steady exponential increase in 
the transmission capacity of fiber optics communications 
systems, resulting mainly from higher transmission rates 
and the implementation of wavelength division multiplex- 
ing (WDM). WDM systems generally use binary on-off 
keying as their signaling scheme, thus limiting the spec- 
tral efficiency (bits transmitted per second per Hz of 
bandwidth) to 1 b/s/Hz. More complicated signaling 
schemes can go beyond this limit, and will eventually 
be required in order to make use of the potential of the 
optical fiber network. Some amount of interesting work 
has been done on the question of the capacity limits for 
WDM and related swtems, both analytical and numer- 
ical (see [S-IS, 13- 3] and references therein). In par- 
ticular, in [g'^the spectral efficiency predicted is about 
8.5bits/sec/Hz, in the absence of jitter. In our model 
we consider a single-channel mode instead of the multi- 
channel setup of WDM. This has the advantage of avoid- 
ing the multiplicative noise resulting from interference 
between different channels. The modulation of the very 
large bandwidth of optical fibers (^ 50 THz) is beyond 
the ability of current electronic equipment, but we are 
interested in the limits posed by the very physics of non- 
linearity, which future technological advances should be 
able to approach. 
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The use of a definite input signal (random Gaussian) 
provides a lower bound for the capacity of the system, 
since the Gaussian is not necessarily the optimal input 
distribution for nonlinear channels. As we said in the 
introduction, the method we use is a transformation to 
the field of eigenvalues of the Zakharov-Shabat operator, 
where the amplifier noise becomes additive. We consider 
a system consisting of a fiber span of total length L, and 
Na equally spaced identical amplifiers that compensate 
for the loss of strength in the signal while unavoidably in- 
jecting it with white noise (amplified spontaneous emis- 
sion noise or ASE noise). For long-range transmission 
systems L ~ 1000km and the inter- amplifier spacing is 
La 50 — 100km. We study the case of bright solitons 
only, since dark solitons are not used for large distance 
communications. Mathematically it is modeled by the 
addition of a term i X^^^^^i ^ — nLa) fn{t) on the rhs of 
Eq. ([2]) with the roles of space and time interchanged. 
The delta-correlated white noise added at each amplifier 
has noise strength j46|] 



hvprispiG - 1) 
GlnG 



{fn{t)f„,it')) =<J^S„,nSit-t'), 

I -r III It 

(28) 

where h is Planck's constant, vq is the carrier frequency, 
rjsp is the spontaneous emission factor and G — e"^" 
is the amplification factor (a = 0.2dB/km) 
causes a random shift in the eigenvalues: 



^out 



6z. 



The noise 



(29) 



As long as the signal to noise ratio, which can be in- 
creased by increasing the input power or reducing the 
distance between amplifiers, is large, Sz (or rather its 
variance) can be calculated using adiabatic perturbation 
theory: 



Sz = 



1 / im 



2i 



(30) 



The denominator here is the non-Hermitian norm of 
the eigenvector. The second moments of Sz, {Sz"^^ and 
(|<5zp) contain integrals of the form J V'lV'l that do not 
correspond to integrals of the motion and it is hard, if 
at all possible, to calculate them analytically. They can 
however be evaluated numerically, using the eigenstates 
corresponding to each eigenvalue ®. 

The rate (per Hz) of information transmission in our 
model (ignoring jitter for the moment) is given by: 



Since we work in the perturbative regime, we can consis- 
tently approximate H[zout] with H[zin], and because the 
noise is additive, H[zout\zin] = H[Sz\. Thus we arrive at 
a simplified formula for the rate; 

R = H[z,,,]-H[Sz]. (32) 

The entropy of Zi„ is calculated from the distribution 
of eigenvalues given in the previous section: P(^, r/) = 
P^i^Priiv)- Only positive values of ^ and rj correspond to 
independent eigenstates, the rest being "mirror images" 
of these ^ is uniformly distributed, with maximum 
value A = 2TrB, where B is the bandwidth, and rj is 
distributed according to: 

-■ (33) 



4 22coth 

Pniv) = 



D 



SI 

sinh^ (f ) 



The incoming entropy (per symbol, in nats) then is: 

H[^,V]= H[£.]+ H[r]] = In A + In (^^^ + 0.08 . (34) 

The entropy of the noise is computed in the Gaussian 
approximation * , as the logarithm of the covariance ma- 
trix of S^,Sr]. The distribution of the logarithm of eigen- 
values of this matrix for a value of D = 2 is shown in 
FIG.El 




FIG. 6: Distribution of the logarithm of the eigenvalues of the 
covariant matrix. The value of D is 2, the size of the system 
is L = 150 and the step size is 0.06. 

The entropy (per symbol) of the noise in the Gaussian 
approximation is: 



R = H[Zout] - {H[Zout\zin])r 



(31) 



H[Sz] =ln(^(27re)5 



(35) 



Note that in our setup this is true only for a random Gaussian 
incoming pulse, because /I'ti'^l is an integral of the motion for 
the NLS equation, and so the statistics are not altered by prop- 
agation. A different initial distribution of signals would require 
Optical Phase Conjugators to periodically restore the statistics. 



The eigenstates for 77 < are related to those with r] > simply 
by complex conjugation of the ZS eigensystem, while those with 
5 < have positive counterparts with ^' = i; + being the 
inverse bandwidth. 
* Note that this makes our lower bound computation more solid, 
since the Gaussian has maximum entropy for given variance. 
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where A is the geometric mean of the eigenvalues of the 
(2A^ X 27V, N being the number of eigenvalues) covariance 
matrix of S^i^Siji, dj. In FIG. [7] we see a plot of ^ for 
ten values of D. 




0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 



D 



FIG. 7: The geometric mean of the eigenvalues of the covari- 
ance matrix as a function of the signal strength D. The solid 
line is a linear fit. The size of the system is L = 150 and the 
step size is 0.06. 



just the eigenvalues {z}, but also the {Infe^}, so that the 
signal description is complete ^ . The evolution of In is 
given by 



(38) 

so that In 62 is affected by the noise in ^ and 77. There 
is also noise specific to Infe^ at each amplifier, but this 
is bounded and can be neglected for long-range trans- 
mission. To properly include the {In 62} in our scheme 
and get a reliable lower bound for the capacity we must 
ascertain their correlations, which we have found to be 
non-zero. We are currently attempting to find an ana- 
lytic expression for the correlations. If this proves im- 
possible, we shall compute their correlation matrix nu- 
merically starting from (1^ and use it to compute the 
incoming entropy. Since the {Info^} follow a Gaussian 
distribution, this does not spoil the lower bound feature. 



The linear fit gives A = QAla^D, and allows us to 
compute a first result for the spectral efficiency lower 
bound (again, ignoring jitter): 



-R = In — 



BGlnG 



Be/ 0.41Wysp(G- 1) 



P 



(36) 



where Pc, tc are the characteristic units of power and time 
used to normalize the units of the NLSE and they are 
inserted here to express the result in SI units. For a 
system with Na amplifiers, one needs to replace with 
Nacr'^. Plugging in typical values for the quantities in- 
volved {Pc = OmW,tc = 3 X 10-"s,G = 100, r],p = 
2,/i = 6.6 X 10-34 J . s,i/o = 2 X 10"i7z, = 10) we 
find: 



i? In (7 X 10"^ X B) bits/s/Hz 



(37) 



with the bandwidth B measured in Hz. Using all the 
available bandwidth for optical fibers, B = 50THz this 
yields a value of i? ~ 25bits/s/Hz, which is significantly 
higher than the predicted result for the case of WDM 
y. One thing to note about this result is that D alto- 
gether disappears, cancelling between the incoming en- 
tropy and the noise. This is because the variance of the 
eigenvalue shift is linear in D. This is in contrast to 
analogous results in WDM models, where increasing the 
signal strength also increases the multiplicative noise be- 
tween channels and the capacity falls after reaching a 
maximum for a finite value of the SNR [s- 11. 



However, this result is incomplete as long as it does 
not include the effects of jitter, which could significantly 
reduce the capacity. To do this we need to consider not 
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